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® Biologically active proteins and a method for their production. 



A method for preparing polypeptides in bacteria with an 
alanine rather than a methionine at the N-terminus. The DNA 
sequence expressed has an alanine codon immediately fol- 
lowing from at least one methionine codon, and allows for the" 
expression of polypeptides having the amino acid sequence 
essentially the same as such naturally occurring eucaryotic 
protein as various bovine and porcine somatotropin species. 
In another important embodiment the invention provides a 
class of compositions comprising certain valine-containing 
somatropin species which increase milk production in lactating 
mammals to a significantly greater extent than othenwise- 
identical leucine somatotropin species. 
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BIOLOGICALLY ACTIVE PROTEINS AND A METHOD FOR THEIR PRODUCTION 



This application is a continuation-in-part of pending 
U.S. Patent Application serial number 704,362 filed on 
February 22, 1985. 

TECHNICAL FIELD 

The present invention is directed to a method of ex- 
pressing heterologous DMA sequences (e.g. eucaryotic 
genes) in procaryotic organisms. In one important embodi- 
ment, the invention is directed to production in procaryotes 
of polypeptides having an N -terminal alanine and to various 
N*alanyi polypeptides that can be produced by that method. 
Also included is a class of valine bGH (bovine growth 
hormone) species that enhance milk production to an un- 
expectedly great extent. 

BACKGROUND OF THE INVENTION 

Expression of genes by eucaryotes and procaryotes, 
while sharing the same basic steps of gene transcription 
into messenger RNA (mRNA) and subsequent translation of 
that mRNA into proteins, employ different sets of intracel- 
lular controls for these steps. 

Additionally, in eucaryotes many mature proteins are 
first translated as pre-proteins: Le.. polypeptides comprised 
of the mature protein's sequence fused to a leader or signal 
sequence. Eucaryotic mRNA encodes the entire pre-protein, 
which is processed after translation to remove the leader 
sequence and provide the mature protein. White eucaryotic 
cells are equipped to specifically process such pre-proteins 
into mature proteins, procaryotic cells are generally not able 
to recognize the processing signals present in eucaryotic 
proteins. Thus, if complete complementary DNA (cDNA) 
transcripts of eucaryotic mRNA are employed as the DNA 
sequences for expression in procaryotes, the pre-protein, 
not the mature protein, is found. It is possible to convert 
pre-proteins to mature proteins 'in vitro , but not without 
significant expense. 

In the event that the DNA sequence encoding the 
mature protein is used for mature protein expression in 
procaryotes, this sequence will be lacking the eucaryotic 
translation and post- translation processing signals usually 
contained within the DNA for the leader sequence. There- 
fore, for expression of cioried eucaryotic genes or other 
heterologous DNA sequences in procaryotic systems, it has 
proven desirable to employ procaryotic control signals for 
reasons of efficiency and because eucaryotic signals may 
not be recognized by a procaryotic host cell. 

The term "heterologous DNA" is defined herein as 
DNA at least a portion of which is not normally contained 
within the genome of the host cell. Examples of heterolo- 
gous DNA include, but are not limited to, viral and 
eucaryotic genes, gene fragments, alleles and synthetic 
DNA sequences. The term "heterologous protein" or 
"heterologous polypeptide" is defined herein as a protein or 
polypeptide at least a portion of which is not normally 
encoded within the genome of the host cell. 

The procar/otic control signals include a promoter 
which signals the initiation of transcription and translation 
control signals comprising a ribosome binding site, a trans- 
lation start signal and a translation stop signal. All of these 
signals except the translation stop signal must be situated in 
front of the eucaryotic gene or other DNA to be expressed. 



The art has adopted several approaches to expressing 
heterologous DNA (e.g. eucaryotic genes) in procaryotes. In 
one approach, the DNA segment encoding the resultant 
protein is ligated to the DNA encoding all or part of a 

5 bacterial protein under the control of its bacterial promoter. 
The endogenous procaryotic DNA necessarily also contains 
the ribosome binding site and translation start signal. Ex- 
pression of such ligated DNA results in what is called a 
fusion protein comprising the eucaryotic polypeptide linked 

10 or fused to a whole or partial bacterial protein. Isolation of 
the eucaryotic protein may then be achieved by site-specific 
enzymatic or chemical cleavage at the endogenous- 
eucaryotic protein fusion site or by selective degradation of 
the procaryotic polypeptide sequences. 

7 5 Examples of published works relating to the production 

in bacteria of eucaryotic fusion proteins include European 
Application 47.600 (published March 17, 1982) which re- 
fers to fusion and non-fusion proteins comprising bovine 
pre-growth hormone or bovine growth hormone ("bGH") at 

20 the carboxy(C-) terminus with or without a portion of a 
procaryotic protein at the amino (N-) terminus; U.K. Patent 
Application GB 2.073,245A (published October 14, 1981) 
referring to fusion proteins of bGH and E.colt iS-lactamase; 
E. Keshet et al.. Nucleic Acid Research , 9:19-30 (1981) 

25 referring to a fusion protein of bGH and E^ coli ^-lactamase; 
European Patent Application 95.361 (published November 
30, 1983) referring to a fusion protein comprising, in se- 
quence, an endogenous protein at the N -terminus, a trans- 
lation start signal amino acid, an enterokinase cleavage site, 

30 and an exogenous protein (e.g. growth hormone) at the C- 
terminus. This fusion protein approach, however, is cum- 
bersome in that it requires in vitro p rocessing following 
purification, and the cost of enzyme(s) for processing com- 
mercial quantities can be prohibitive. 

35 Fusion proteins, however, have become an attractive 

system for expressing some eucaryotic genes or other het- 
erologous DNA in procaryotic cells, as the fusion product 
appears to protect some of the resulting heterologous pro- 
teins from intracellular degradation. Bacterial cells appear to 

40 recognize some eucaryotic proteins produced therein as 
foreign and, thus, proceed to degrade these proteins as 
soon as the proteins are made or shortly thereafter. Fusion 
proteins engineered for protective purposes can employ 
endogenous polypeptide sequences at either the amino or 

45 carboxy terminus of the heterologous protein. An example of 
the tatter approach is found in European Patent Application 
111,814 (published June 27, 1984) describing fusion pro- 
teins comprising a form of bGH having a synthetic front-end 
(amino-terminus) and E, coli /3-galactosidase at the C-termi- 

so nus. The advantages are, again, diminished by the need to 
subsequently cleave the heterologous protein from the en- 
dogenous polypeptide as discussed above. 

In another approach, the translation start signal, ATG, 
under the control of a bacterial promoter, is located imme- 

55 diately preceding the DNA sequence encoding a heterolo- 
gous (e.g. eucaryotic) protein free from endogenous protein 
at both the N-and C-termini of the protein produced. Al- 
though the proteins produced by such gene constructs do 
not require subsequent cleavage to generate the desired 

60 protein, they typically include a methionine (in some cases 
a formyl-methionine) at the N -terminus as the ATG start 
signal is also a methionine codon. Thus, unless the desired 
mature protein begins with methionine, the protein will now 
have an N -terminus altered by inclusion of that methionine 

65 residue. 
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Examples of such gene constructs include Guarente et 
al.." Ceii (1980) 20:543-553 wherein the rabbit /3-g!obin 
gene, which possesses an N-terminal valine, is expressed in 
E. coljemploying the gene construct just described. The 
investigators found that whereas "In rabbit ^3-globin, there is 
no amino terminal methionine, and leucines are found at 
positions 3, 14. 28, 31. 32 ....In the labeled protein, 
leucines were found at positions 4, 15, 29, 32 and 33, and 
a methionine was found at position i . This result shows that 
the protein is rabbit ^-globin plus an amino terminal 
methionine which is not removed in E.coii ." Id at 546-547. 

Another example relates to the production of growth 
hormones in bacteria employing the above-described gene 
construct Schoner et al. Proc. Nat'l.Acad. Sci. U.S.A. - 
(1984) 81:5403-5407 describes a high level expression 
system in bacteria for production of bGH which results in 
production of an N-methionyl bGH; that is, a compound 
having an amino acid sequence like that of one of the 
naturally-occurring bGH species plus a methionine at its N- 
terminus. The addition of an N-terminal methionine to var- 
ious growth hormone species produced in bacteria is again 
discussed in European Patent Application 103,395 
(published f^arch 21, 1984) and European Patent Applica- 
tion 75,444 (published March 30, 1983), for bGH, and 
Seeburg et al., DNA (1983) 2:37-45, for b(3H and porcine 
growth hormone ("pGH"). 

Addition of an N-terminal methionine to the natural N- 
terminus may be undesirable for several reasons. First, it is 
possible (although currently believed unlikely) that the 
methionine may tend to make the protein antigenic in an 
organism in which the protein without the N -methionine is 
endogenous. Second, the addition of methionine to the N- 
terminal portion of the protein may have an undesirable 
effect on its bioactivity or its physical properties. Third, this 
altered form of the protein may hinder scientific efforts in 
determining the relationship of the natural protein's function 
to its structure. Further, it may be advantageous to have a 
biosynthetic protein which is structurally as close as possi- 
ble to a naturally-occurring protein when applying for gov- 
ernmental approval of medical or veterinary applications. 

The ability of such procaryotes as bacteria to remove 
the N- terminal methionine from proteins either during their 
production or thereafter has been a topic of considerable 
interest. For example. Waller, J. Mol. Biol. (1963) 7:483- 
496, examined the N-terminal amino acid composition of 
"soluble" and ribosomal proteins from a cell-free extract of 
E.coii . and European Patent Application 103,395 - 
(published March 2i. 1984) discloses the removal of an N- 
terminal methionine from an eucar/otic protein produced in 
E. coli. Specifically, methionine is removed from one of two 
bacterially- produced bGH proteins both of which contain a 
serine residue immediately following to the originally- present 
N-terminal methionine. The gene construct employed in 
these studies, however, comprised a synthetic start se- 
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quence coding for 5'-methionine-serine-leucine-3' inserted 
immediately adjacent to the 5' end of a bGH coding se- 
quence in which bases coding for the first 4 or 9 naturally- 
occurring amino acids had been deleted. Thus, the resultant 
5 protein produced in coliwas a non-naturally-occurring 
protein. U.K. Patent Application 2,073, 245A (published Oc- 
tober 14, 1981) discloses that when met pro replaces ala in 
the mature bGH protein, "Met can be processed by bac- 
teria to give modified bGH starting with the amino acid 

w sequence Pro Phe Ala Pro". 

There is, thus, a need to develop an economic and 
predictable means for producing in such microorganisms as 
bacteria heterologous (e.g. eucar/otic) proteins that do not 
have an N-terminal methionine. Specifically, it is especially 

7 5 desirable to develop a method whereby such proteins pro- 
duced in bacteria do not require in vitro , post-fermentation 
processing and do not contain an additional, non-naturally 
occurring N-terminal methionine. 

Growth hormones (also called somatotropins) are poly- 

20 peptides produced and secreted by cells of the pituitary 
gland and are largely species -specific in their actions. In 
addition to their role in promoting skeletal growth, growth 
hormones affect a variety of metabolic processes including 
the stimulation of lactation, increased insulin release from 

35 the pancreas and glucagon secretion, and they exert a lipid - 
mobilizing effect Exogenous administration of bGH to cattle, 
for example, has been demonstrated to increase milk pro- 
duction, feed efficiency and/or growth rate, decrease fatten- 
ing time and increase the lean- to- fat ratio. It is still not fully 

30 understood, however, how the hormone exerts these mul- 
tiple effects. 

Extensive work with human growth hormone (hGH) has 
established that the hormone, as secreted by the human 
pituitary gland, is not a single molecular entity but a mixture 

35 of polypeptides- Fractionation of the various hGH species 
has resulted in the preparation of some hGH fractions with 
neither diabetogenic or lipolytic activities. 

Similarly, bGH is produced by cattle in multiple forms. 
Specifically, four forms of bGH are produced which differ at 

40 two positions of the protein. The N-terminal amino acid can 
vary due to a presumed ambiguity in removal of the signal- 
peptide (leader) sequence so that the mature protein begins 
with either NHa-phe-pro or NH,-ala-phe-pro. In addition, 
there is a heterogeneity at amino acid 126 being either a 

45 leucine or a valine. This is apparently due to an allelic 
variation in the bovine population. Wallis (1969) FEBS 
Letters 3:118-120; Fellows and Rogol (1969) J. Biol. 
Chem244: 1567-1 575; Fernandez et al. (I97i) FEBS Let- 
ters 18:53-54, Fellows (1973) Personal comments in Re- 

50 cent Prooress in HormoneResearch 29:404; Santome • 
(1973) Eur. J. Biochem. 37:164-170; Graf and Li (1974) 
Biochem. Biophvs. Res. Comm. 56:168-176. The four mo- 
lecular forms (species) of pituitary bGH are herein des- 
ignated and abbreviated as follows: 

55 



Abbr . Structure 

bGH(L) NH2-phe(l)-pro(2) . . .leu(126) . . .COOH 

bGH(A,L) NH2-ala(-l)-phe(l)-pro(2) . . .leu(126) . . .COOH 

bGH(V) NH2-ptie(l)-pro(2) . . .val(126) . . .COOH 

bGH(A,V) NH2-ala(-l)-phe(l)-pro(2) . . .val(126) . . .COOH 
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The bGH{A.V) and bGH(V) species are sometime^ 
collectively referred to herein as "valine allelic bGH spe- 
cies" or "valine bGH species". Similarly, the bGH (A,L) 
and bGH(L) species are sometimes collectively referred to 
herein as "leucine allelic bGH species" or "leucine bGH 
species". The numbers assigned to the aniino acids in the 
bGH proteins as abbreviated above are for purposes of 
identification and reference only. 

Mills et al- (1970), J. Biol. Chem. 245:3407-34i 5. simi- 
larly identified two cyanogen bromide fragments of porcine 
growth hormone (pGH) exhibiting a heterogeneity at their 
respective N-termini. Specifically, one fragment contained 
an N -terminal phenylalanine and the other an additional N- 
terminal alanine. These molecular forms of pGH are herein 
abbreviated as pGH(P) and pGH(A), respectively. 

The entire DNA coding sequences and corresponding 
amino acid sequences for bGH(L) and pGH{P) have been 
published by Seeburg et al., DNA(1983) 2:37-45, which is 
incorporated herein by reference. 

Pituitary cells of individual cattle have been generally 
found to contain a mixture of at least bGH(A,L) and bGH- 
(L), or bGH(A.V) and bGH{V). N-terminal analysis of bGH 
proteins produced in cultured pituitary cells shows an ap- 
proximately 50:50 mixture of molecules containing either an 
N-terminal phenylalanine (phe) or an N-terminal alanine - 
(ala). Commercially available preparations, made from 
pooled pituitaries of many cattle, generally include all four 
molecular forms of pituitar/ bGH. Furthermore, it has been 
reported that about thirty percent of the molecules in bGH 
preparations obtained from pooled glands have valine re- 
placing leucine at position 126. Ferandez et al. (1971) 
FEBS Letters 18:53-54. Standard biochemical methods for 
separating the four known bGH forms do not permit produc- 
tion of each or any one of these species on a commercial 
scale. Differing biological activities of these four forms of 
bGH would be desirably studied and made commercially 
available using each of the forms essentially free of one or 
more of the other three forms, and/or free of other proteins 
of bovine. origin. The term "essentially pure" as used herein 
to describe a specific protein or proteins means essentially 
free from proteins and/or other substances with which the 
protein(s) is (are) associated in a natural environment or 
source. For those and other purposes, the objects of this 
invention include a method whereby at least some of those 
individual forms of bGH can be conveniently produced. 

Accordingly, it is an object, of the present invention to 
produce in procaryotes eucaryotic or other heterologous 
polypeptides which do not have a methionine residue at the 
N -terminus. 

It is another object of the invention to produce in 
procaryotes eucaryotic or other heterologous polypeptides 
that do not require in vitroprocessing to remove a 
methionine from the N -terminus. 

Still another object of the invention is to provide a 
method for production in procaryotes of eucaryotic or other 
heterologous polyF)eptides that have an N-terminal alanine 
without the need for invitro processing. 

Another object of the invention is to produce in pro- 
caryotes eucaryotic or other heterologous polypeptides that 
have the amino acid sequence essentially the same as a 
naturally-occurring protein which does not have an N- termi- 
nal methionine. 

A further object of the invention is to provide a method 
of producing in procaryotes polypeptides having the amino 
acid sequence found in mature eucaryotic polypeptides 
such as bGH(A.L). bGH(A,V) and pGH(A). 



It is a still further object of the invention to provide 
bGH(A,L), bGH(A,V) or pGH{A) substantially free of pro- 
teins of bovine or porcine origin, respectively. 

It is yet a further object of the present invention to 
5 provide a class of bGH species which enhance milk produc- 
tion in cows to an advantageously greater extent. 

The bGH polypeptides produced by methods of the 
present invention provide a means for potentiating such 
somatotropin activities as increased milk production, growth 
70 rate and/or feed efficiency. 

These and other objects of the invention will be more 
fully apparent from the following general and detailed de- 
scription of the invention. 

75 SUMMARY OF THE INVENTION 

Objects of this invention are achieved, in one embodi- 
ment, by a method for bacterialiy producing a heterologous 
polypeptide having an N-terminal alanine which comprises 

20 causing expression of genomic DNA in selected bacteria, 
said DNA containing from one to about three contiguous 
methionine codons including a translation start signal, fol- 
lowed immediately by the codons for said polypeptide, fol- 
lowed by a translation stop signal codon, and recovering the 

25 resulting heterologous polypeptide having an N-terminal al- 
anine produced within said bacteha. 

In another embodiment, the present invention provides 
a method of producing a heterologous polypeptide having 
an N-terminal alanine comprising causing expression of 

30 genomic DNA in selected bacteria, said DNA containing a 
translation start signal/methionine codon immediately fol- 
lowed by the codons for said polypeptide, followed by a 
translation stop signal codon, and recovering the resulting 
heterologous polypeptide having an N-terminal alanine pro- 

35 duced within said bacteria 

In yet another embodiment, the present invention pro- 
vides a method for preparing in bacteria a heterologous 
polypeptide containing an amino acid sequence essentially 
the same as that of a naturally occurring eucaryotic poly- 

40 peptide. 

In other embodiments, the present invention provides 
various compositions comprising such somatotropins as 
bGH{A,L). bGH(A,V), pGH(A). and mixtures of bGH(A,L) 
and bGH{A,V). substantially free of polypeptides of bovine 

45 or porcine origin, respectively, or of other somatotropin 
species. 

In still another embodiment, it was discovered that 
compositions comprising a class of valine bGH species 
enhance milk production in cows to a greater extent than 

50 otherwise-identical leucine bGH species or bGH(A,L). In 
one preferred embodiment, compositions able to achieve 
such enhanced milk production, which compositions com- 
prise essentially pure valine bGH species, are provided. 
Other embodiments include various genes, DNA vec- 

55 tors and transformed bacteria useful in the aforementioned 
methods, as well as certain methods utilizing the aforemen- 
tioned compositions for enhancing bovine and/or porcine 
lactation, pre-adutt growth, and/or feed conversion efficien- 
cy. 

60 

BRIEF DESCRIPTION OF THE DRAWINGS 

In the following diagramatic representations, the 
hatched box represents the DNA coding sequence for a 
65 bacterial promoter, the blackened box represents a heterolo- 
gous DNA coding sequence, the t»ars represent additional 
DNA coding sequences as labeled, and the directional 
arrow represents the 5' to 3" orientation of the DNA coding 
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sequences. Relevant restriction endonuclease sites are also 
shown. The DNA regions so marked are for purposes of 
diagramatic representation only and are not drawn to scale. 

Fig. 1 depicts the construction of Ml3mp8/Xbal comprising 
a Ml3mp8 vector having inserted therein at the Smal 
restriction endonuclease site an Xbal restriction en- 
donuclease site. 

Fig. 2 depicts the construction of Ml3mp8/BGHex-i compri- 
sing Ml3mp8/Xbal carrying a bGH(L) DNA coding se- 
quence. 

Fig. 3 depicts the creation of a bGH(A,L) DNA coding 
sequence by oligonucleotide-directed site-specific mutagen- 
sis. 

Fig. 4 depicts the creation of a bGH{A,V) DNA coding 
sequence by oligonucleotide-directed site-specific 
mutagenesis. 

Fig. 5 depicts the construction of a pfvlON3209 expression 
vector comprising pBGHex-i carrying a bGH(A,L) DNA cod- 
ing sequence in place of a bGH{L) DNA coding sequence. 

Fig. 6 depicts the construction of a pMON32l5 expression 
vector comprising pBGHex-i carrying a bGH{A,V) DNA cod- 
ing sequence in place of a bGH(L) DNA coding sequence. 

Fig. 7 depicts the construction of Mi3mp9/PGHej<-i compri- 
sing Ml3mp9 carrying a pG H(P) DNA coding sequence- 
Fig. 8 depicts the creation of a pGH(A) DNA coding se- 
quence by oligonucleotide-directed site-specific 
mutagenesis. 

Fig. 9 depicts the construction of pBGHex-t' comprising 
pBGHex-i wherein the EcoRI restriction endonuclease site 
located upstream from the 5' end of the ptrp DNA coding 
sequence has been removed. 

Fig. 10 depicts the construction of a pf^ON32l3 expression 
vector comprising pBGHex.r carrying a pPGH(A) DNA 
coding sequence in place of a bGH{L) DNA coding se- 
quence. 



DETAILED DESCRIPTION OF THE INVENTION 

The present invention provides a method for producing 
in a procaryote a heterologous polypeptide, such as a 
eucaryotic (e.g. mammalian or avian) protein, that has an 
N -terminal alanine. Thus the polypeptide is produced with- 
out an N-terminal methionine, and the need for in 
vitroo rocessino to produce such a polypeptide without an N- 
terminal methionine is eliminated. The consistent production 
of such a polypeptide lacking an N-terminal methionine 
present In the gene ccxjing sequence is a novel and quite 
unexpected result 

The present invention provides a valuable method for 
production of essentially pure proteins that have an N- 
terminal alanine. Such proteins include, without limitation, 
given species of bovine and porcine somatotropin and vari- 
ants thereof, the plant proteins ribulose-i.5-bisphosphate 
carboxylase small subunit, glutathione S-tranferase, and 
heat shock protein 70. The invention is also useful for the 
production of other polypeptides where it is desirable to 



have an alanine at the N -terminus rather than a methionine. 
It can be desirable to have an N-terminal alanine rather 
than an N-terminal methionine because, inter alia, the N- 
alanyl form of the polypeptide may be less immunogenic, or 

5 have different physical properties or a modified bioactivity. 

In the examples of the present invention wherein es- 
sentially pure species of bGH or pGH are produced by 
bacterial cells, evidence and teachings for removal of the 
N-terminal methionine are clearly lacking. Indeed, all reports 

10 of bGH expression by bacterial cells wherein the N-terminus 
comprises an N-terminal amino acid sequence homologous 
to naturally-occurring bGH species report the presence of 
an N-terminal methionine. In Seeburg et aL, DNA (1983) 
2:37-45 at 44, the gene sequence for the N-terminal 

75 phenylalanine species of bGH, e.g. bGH(L), was delib- 
erately chosen for expression in E. coli, in part to avoid the 
expected addition of a second hydrophobic amino acid - 
(methionine) to the hydrophobic N-terminal alanine of the 
other bGH species, i.e., bGH(A,L). Thus, available studies 

20 to date teach a retention of the N-terminal methionine in 
bacterial ty- produced species of bGH. Notwithstanding these 
reports, I determined that for the reasons discussed above, 
there exists a need to produce each of the two bGH 
species, bGH(A,L) and bGH(A,V), and the pGH species 

25 pGH(A). However, attempts to produce these somatotropin 
species in bacteria were undertaken with the expectation 
that the polypeptides produced would contain an N-terminal 
methionine. 

As detailed in the examples of the present invention, 

30 my approach to bGH(A,L), bGH(A,V) and pGH(A) produc- 
tion in bacteria was briefly as follows. The DNA coding 
sequences for the aforementioned proteins were construct- 
ed by oligonucleotide-directed site-specific mutagenesis of 
the DNA coding sequences for bovine and porcine 

35 somatotropin species containing an N-terminal 
phenylalanine as shown in Figs. 3, 4 and 8. Thereafter, the 
bGH(A,L), bGH(A,V) and pGH(A) coding sequences were 
inserted into expression vectors such that the final gene 
sequence comprised, sequentially, a promoter, a ribosome 

40 binding site, an ATG start/methionine codon immediately 
adjacent the DNA coding sequence for either bGH(A,L), 
bGH(A,V) or pGH(A). and a translation stop codon. E.coli 
were then transfected with a given expression vector carry- 
ing the desired gene sequence and cultured under con- 

45 ditions which would allow for expression of the desired 
heterologous DNA and production of the desired heterolo- 
gous protein. The proteins so produced were then sequen- 
ced and assayed for their appropriate biologic activity. 

Thus, it has been discovered here that when a DNA 

50 sequence containing a start signal/methionine codon fol- 
lowed immediately by the codons for a heterologous N- 
alanyl polypeptide is expressed, the protein recovered from 
the procaryotic organism actually has alanine at the N- 
terminus, not a methionine. It is believed that a similar result 

55 is obtainable when that alanine codon is immediately pre- 
ceded by up to about three contiguous methionine codons 
which include the start signal for translation of the mRNA 
encoding the desired polypeptide product. For example, the 
DNA including that translation start signal and the codons 

60 for the desired polypeptide product can include any se- 
quence appropriately coding for rnet ala, met met ala, met 
met met ala, or any functional equivalent thereof. 

An N-alanyl polypeptide is herein defined as a poly- 
peptide having an alanine at its amino terminus. While 

65 applicant does not wish to be bound by the following theory 
of mechanism, it is believed that subsequent to translation, 
the N-terminal methionine of the polypeptide is enzymatical- 
ly removed by the procaryote if the next amino acid is 
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alanine or another amino acid having similar characteristics 
(e.g. polarity or hydrophobicity) that facilitate such removal 
of the N -methionine. It is furthermore believed that many 
procaryotes are capable of removing N -terminal methionines 
from heterologous and/or endogenous polypeptides when 
said N -terminal methionines are immediately followed by an 
alanine. 

These procaryotes (e.g. various bacteria known and 
publicly available through recognized microorganism deposi- 
tor institutions such as the ATCC or otherwise) include but 
are believed not limited to E coli and various strains there- 
of. Indeed, it is contemplated that any procaryote capable of 
producing a polypeptide having an N -terminal alanine from 
a DNA coding sequence which begins with from one to 
about three contiguous methionine codons immediately fol- 
lowed by an alanine codon is potentially useful in the 
practice of the invention disclosed herein. Commercially and 
otherwise available procaryotic organisms may be screened 
for their ability to produce such N-alanyl polypeptides by 
inserting into the genome of said organisms a gene com- 
prising sequentially a promoter operative in said organism, 
DNA encoding a ribosome binding site, from one to about 
three contiguous methionine codons including a translation 
start signal followed immediately by the codons for a het- 
erologous N-alanyl polypeptide and a translation stop signal, 
thereafter causing expression of said gene and then deter- 
mining the N -terminal amino acid sequence of the heterolo- 
gous polypeptide so produced. When such a procar/otic 
organism has been found to produce internally such a 
heterologous N-alanyl polypeptide, that organism is within 
the class defined as "selected" for purposes of the disclo- 
sure and claims herein. 

In a preferred embodiment of the present invention, 
three different isolates of E.coli strain K12 alt of which have 
been deposited with the American Type Culture Collection, 
Rockville, Maryland, and have been assigned accession 
numbers ATCC 39936. 53010, and 53009 demonstrate an 
ability to remove N-terminal methionines when said N- 
terminal methionines are immediately followed by an al- 
anine. 

The present discovery is significant since it provides a 
method of producing in procaryotes heterologous polypep- 
tides that have N-terminal alanines. 

In one of the preferred embodiments, the method of the 
present invention is employed to generate two bGH species, 
bGH{A,L) and bGH{A.V). and one pGH species, pGH(A), 
free from other bovine or porcine proteins and/or other bGH 
or pGH species, respectively. Specifically, the method pro- 
vides for production of bGH(A,L). bGH(A,V) or pGH(A) as 
single species. The ability to produce a single bGH or pGH 
species with amino acid sequences homologous to naturally 
occurring somatotropins has tremendous import for deter- 
mining the precise bioreactivity of each bGH or pGH spe- 
cies given the numerous potentiating effects of bGH and 
pGH generally as cited above. In fact, it has been found 
that administering either of the two N-alanyl bGH species 
yields a potentiation of such bGH functions as milk produc- 
tion. Further, it has been found that administration of a 
lactation-enhancing amount of bGH(A,V) produced in accor- 
dance with the present invention increases milk production 
in dairy cows to a statistically (e.g. p<0.05) greater extent 
than similarly produced bGH(A.L). To date, there has been 
no teaching of the relative effectiveness of specific bGH 
forms (species) in enhancing milk production in lactating 
mammals. Furthermore, there has been no prior result, in 
vivo or in_ vitro , suggesting that a difference in bioactivity 
between bGH species would be observed. Thus, the finding 
that the valine allelic bGH species result in a greater 



enhancement in milk production than the leucine allelic bGH 
species was both surprising and unexpected. This finding 
further suggests that differences in the relative effectiveness 
of bGH species in potentiating such other growth hormone 
5 properties as increased feed efficiency and growth stimula- 
tion can similarly be defined. Thus, by employing the meth- 
ods of the present invention which permit the production in 
bacteria of at least two of the molecular forms of pituitary 
bGH is essentially pure form and/or by employing other 
70 available technologies, the bGH species most effective in 
achieving a specific growth hormone induced response can 
now be produced. Such other available technologies in- 
clude, but are not limited to, chemical synthesis of entire 
bGH proteins or fragments thereof, and/or production in 
75 Other microorganisms such as yeast or in mammalian cells 
using known recombinant DNA techniques. 

Additionally, once the bioactivity of each naturally-oc- 
curring species is determined, it is considered feasible to 
generate polypeptide variants of each of such species 
20 which would further increase their somatotropin activity. It 
is, therefore, anticipated that the generation of somatotropin 
variants by nucleotide or amino acid deletion, substitution 
and/or addition will provide various useful equivalents of the 
compositions disclosed herein. Such variants include an 
25 altered form of bGH(V) wherein the alteration comprises the 
addition of a methionine to the amino terminus. This variant, 
produced by genetically transformed bacteria, has also been 
found to increase lactation in dairy cows to a surprisingly 
great extent when administered in a lactation-enhancing 
30 amount. Furthermore, the extent of the observed increase in 
lactation pursuant to administration of this bGH(V) variant 
was found to be measurably greater than the lactation- 
enhancement observed pursuant to administration of an 
otherwise identical variant containing leucine at amino acid 
35 position 126. 

While applicant does not wish to be bound by the 
following theory of mechanism, it appears that the substitu- 
tion of valine for leucine at position 126 significantly en- 
hances the bioavailability and/or bioreactivity of such 
40 somatrotropin protein(s). Specifically, x-ray crystallography 
performed on somatotropin proteins indicates that the region 
of said proteins from about amino acid 90 to about amino 
acid 135 constitutes a relatively flexible region. Studies of 
other proteins have shown that flexible regions often com- 
45 prise a bioreactive site (e.g. a site which interacts with 
biologic receptors and/or other portions of the same protein 
to achieve a bioactive form). Thus, it is anticipated that 
additional variants of t»GH (A, V), which variants comprise 
amino acid substitutions, deletions, additions and/or inver- 
50 sions within this described flexible region and/or outside this 
region that likewise result in a measurable enhancement of 
lactation (e.g. milk production) to a greater extent than 
otherwise-identical leucine bGH species or bGH {A,L), can 
be made. For example, changes at or around amino acid 
55 position 1 26 can include a substitution of other more hydro- 
philic and/or smaller amino acids (relative to leucine) provid- 
ing that the described enhancement in milk production is not 
diminished to an intolerable degree. 

Further, it is anticipated that valine bGH species con- 
so taining an N-terminal phe„ or an N-terminal ala^ can 
achieve the described enhancement in milk production 
when administered to cows. It is also anticipated that 
changes in the amino terminus (e.g. met)) which do not 
substantially differ from the N -terminus of naturally-occurring 
65 bGH would not interfere to an intolerable degree with the 
ability of valine bGH species to enhance milk production to 
a measurably greater extent than otherwise-identical leucine 
bGH species of bGH(A,L). it is further anticipated that a 
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bGH composition wherein the concentration of valine bGH 
species, based on the weight of all bGH in the composition, 
is substantially greater than the concentration of valine bGH 
species in pooled pituitary bGH preparation would similarly 
achieve the described result. 

In one of its broadest embodiments, the present inven- 
tion is a refinement in the use of a recombinant DNA 
technology to directly produce in procaryotes heterologous 
polypeptides. Thus, the description of the invention presup- 
poses knowledge of the basic techniques employed in re- 
combinant DNA technology to isolate and clone DNA se- 
quences encoding polypeptides, the rearrangement or alter- 
ing of cloned DNA sequences, and the expression of cloned 
or modified DNA sequences in transformed microorganisms. 
Such techniques are within the skill of the art. (See. e.g. . 
MolecularClonino: A Laboratory Manual : Maniatis, Fritsch & 
Sambrook eds. 1982). 

ISOLATION AND/OR CONSTRUCTION OF HETEROLO- 
GOUS DNA 

In one embodiment of the present invention, the DNA 
sequence coding for the desired heterologous polypeptide to 
be produced in the procaryote is chosen and isolated, or a 
DNA sequence that encodes it is constructed or chemically 
synthesized. In many important embodiments, the polypep- 
tide is a eucaryotic protein. If the polypeptide is small and 
the complete amino acid sequence known, a synthetic DNA 
molecule or sequence encoding the polypeptide can be 
constructed. If the amino acid sequence of the polypeptide 
is unknown, or its size is too large for the synthesis of a 
corresponding DNA. sequence to be practical, a cDNA 
sequence can be prepared by reverse transcription from 
corresponding mRNA . obtained from tissues or cells ex- 
pressing the polypeptide. For example, in one embodiment 
of the present invention, the sequence for bGH can be so 
obtained from bovine pituitaries by now routine procedures 
described by Goodman et al. Methods in Enzvmoloov 
68:75-90 (1979). Alternatively, a cDNA sequence can be 
prepared from mRNA isolated from cells transformed with 
genomic DNA isolated from a native gene bank with an 
appropriate probe. Genomic DNA can also be modified in 
various vector systems such that the DNA can be ex- 
pressed in procaryotes. These techniques are . within the 
skill of the art 

Once a heterologous DNA sequence containing the 
codons for the desired polypeptide is obtained, it may be 
desirable to make certain modifications in the nucleotide 
sequence of the molecule. For example, if the molecule has 
been produced by reverse transcription from an mRNA 
template, it will often contain at least a portion of the DNA 
encoding the leader sequence of the pre-protein. Thus, it is 
necessary to remove all of the leader-sequence DNA prior 
to the first codon of the desired protein. In some cases it 
may be necessary to add or substitute an alanine codon at 
the beginning of the sequence encoding the desired protein 
if it does not already have an N -terminal alanine codon. 
Then a translation start signal (which is also a methionine 
codon) is introduced upstream and immediately adjacent to 
the alanine codon. While the start signal/methionine codon 
will usually (and preferably) be the nucleotide sequence 
ATG, the sequence GTG can also occasionally serve as a 
start initiation signal/methionine codon. Additionally, the 
presence of more than one methionine codon, e.g. two, 
three or possibly more contiguous codons for methionine, is 
understood to constitute equivalence within the method of 
the present invention. 



If not already present, at least one translation stop 
signal should be introduced after the codon for the C- 
terminal amino acid. Examples of translation stop signals 
include the deoxynucleotide triplets TAA, TGA, and TAG. 
5 In essence, therefore, recombinant DNA techniques are 
applied to construct a recombinant DNA sequence contain- 
ing, sequentially, a translation start signal/methionine codon, 
the codons for the desired polypeptide with the N- terminal 
alanine codon adjacent to the start signal, and at least one 

70 translation stop signal adjacent to the codon for the C- 
terminal amino acid. 

It has been found that efficient expression of mRNA 
can be hindered by secondary structures formed by the 
hydrogen bonding of two complementary series of 

75 nucleotides within the mRNA. Elimination of these com- 
plementary sequences, particularly in that portion of the 
molecule that encodes the N-terminus, facilitates the binding 
of ribosomes to the mRNA and, therefore, increased levels 
of expression. It may be desirable, therefore, to replace 

20 codons that participate in the formation of such secondary 
structures with codons for the same amino acid, but com- 
prised of a different nucleotide triplet See European Patent 
75,444 (published March 30, 1983); Seeburg et al., (1983) 
DNA 2_:37-45 and Shoner et al (1984) Proc. Nat'l. Acad. 

25 Sci. U.S.A. 8l:5403-5407. 

Other approaches to constructing the heterologous 
DNA sequences will be apparent to those skilled in the art. 
For example, if a DNA molecule is available that encodes a 
polypeptide which is expressed with an N -terminal structure 

30 of NH2-met-x-y... where x is an amino' acid other than 
alanine, an alanine codon can be inserted between the 
translation start signal/methionine codon and the codon for 
X. Alternately, the codon for x may be deleted and an 
alanine codon inserted in place thereof. Thus, a protein 

35 having the N-terminal structure NH2-ala-x-y... or NH2-ala- 
y..., respectively, would be produced in the process of the 
present invention. 

Similarly, deletions, additions and/or substitutions in any 
of the amino acid codons within a given gene sequence 

40 may be made so that a variant polypeptide would be 
expressed in the process of the present invention. A 
"variant" polypeptide is defined herein as having single or 
multiple amino acid deletions, substitutions and/or additions 
as compared to the naturally occurring amino acid se- 

45 quence of a given polypeptide. Examples of such variants 
include but are not limited to met-bGH(L) and met-bGH(V) 
wherein the amino acid sequence of these variant bGH 
species are identical to the bGH(L) and bGH(V), respec- 
tively, produced by bovine pituitary cells except for the 

50 presence of an additional methionine at the N-terminus. 
These variant polypeptides are construed as having an 
amino acid sequence essentially the same as that of a 
naturally occurring polypeptide as long as the biologic activ- 
ity is not diminished to an intolerable degree. Creation and 

55 expression of variant polypeptides may be desirable in order 
to achieve increased accumulation, increased protein stabil- 
ity, to facilitate polypeptide purification, and/or in optimizing 
biological activity. 

The above modifications of the DNA molecule encod- 

60 ing the desired polypeptide can be accomplished using 
restriction enzymes, exonucleases, endonucleases, etc. by 
techniques known in the art The general techniques of 
oligonucleotide-directed site-specific mutagenesis can also 
be employed to effect the above modifications in the struc- 

65 ture or sequence of the DNA molecule and are known to 
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those of skill in the art. See e.g. . Zoller & Smith, (1982) 
Nuc . Acids Res. 10:6487-6500; Zoller & Smith, (1983) 
Meth . Enzvmol. 1 00 :468:500: Norrls et al., (1983) Nuc. Ac- 
ids Res. 11:5103-51 12. 

In accordance with recombinant ONA techniques, once 
the desired heterologous DNA sequence is obtained, the 
sequence is then inserted into an appropriate cloning vector 
which provides a means for replicating the DNA sequence. 
Any appropriate cloning vector, preferably containing a 
marker function, may be used, for example E. coli plasmid 
vectors which include Col El, Hershfield et al., Proc. Nat'l. 
Acad. Sci. U.S.A. (1974) 71:3455; pBR322, Bofivar et al.. 
Gene (1977) 2:95: pBR325. Soberon et al.. Gene (1978) 
4:121; and pkc7. Rao et al.. Gene (1979) 7:79; and E. 
colibacteriophage vectors which include Charon XL47.1, 
Loenen et al.. Gene (1980) 10:249; and Ml3mp8 and 
Mi3mp9, Messing et al., Gene (1982) 19:269. The general 
techniques for inserting said DNA sequence into a cloning 
vector to create a recombinant vector are within the skill of 
the art. Seee.g. . Molecular Cloning: A Laboratory Manual : 
Maniatis, Fritsch & Sambrook, eds. (1982). 

Once multiple copies of the desired heterologous DNA 
sequence are obtained, these sequences may be removed 
from the recombinant vectors and inserted into an expres- 
sion system for production and isolation of the desired 
heterologous protein as described more fully below. Modi- 
fications of the heterologous DNA sequence, by methods 
known to those skilled in the art may be made prior to 
insertion of these DNA sequences into an expression vector 
or following said insertion. 

In the examples of the present invention. Mi3mp8, 
described by Messing et al.. Gene (1982) 19:269. modified 
to contain an Xbal restriction site as shown in Fig. i , was 
chosen as a cloning vector along with Ml3mp9, \± The 
Ml3mp8 and Ml3mp9 vectors, collectively "Ml 3 vectors'*, 
allow for isolation of recombinant vectors in both double- 
stranded (ds) or replicative form (RF) and single-stranded - 
(ss)DNA forms. Isolation of RF DNA recombinant vectors 
facilitates the subsequent insertion of the replicated desired 
DNA sequences into expression vectors, as shown in Figs. 
5, 6 and 10. Alternatively, isolation of the single- stranded 
form of these recombinant vectors facilitates both isolation 
of recombinant vectors which contain the desired DNA 
sequence. in a proper 5* to 3' orientation for expression and 
the creation of any DNA sequence modification by such 
techniques as oligonucleotide-directed site-specific 
mutagenesis, as shown in Figs. 3, 4 and 8. Additionally, 
these Ml3 vectors can accommodate DNA fragments or 
genes up to 4kb in length which insures the cloning of a 
typical, entire, eucaryotic gene sequence. 

The marker function employed in the M13 vectors, as 
described by Messing et al.. Gene (1982) 19:269, involves 
the enzyme for 5-galactosidase- Specifically, the desired 
heterologous DNA sequence is inserted into the lacZ gene 
fragment carried on the Ml 3 vector which disrupts the 
normal complementation of the lacZ gene fragment carried 
on the M13 vector with the partial lacZ gene fragment 
carried on the chromosomal DNA of the host cell (e.g. 
E.coli JM101) so that said host is no longer capable of 
metabolizing lactose present in the bacterial growth medium. 
E. coli infected with Mi 3 vectors which do not have a 
foreign gene sequence inserted into the vector borne lacZ 
gene fragment are capable of metabolizing lactose present 
in the bacterial growth medium and yield characteristic blue 
plaques when the bacteria are grown on agar containing i x 
YT medium comprising 0.8% (w/v) tryptone. 0.5% (wA/) 
yeast extract, 0.5% (w/v) NaCi and a color indicator for 8- 
galactosidase. The plaque coloration of E.coli infected with 



recombinant vectors carrying an inserted heterologous DNA 
sequence in the Mi3 lacZ gene fragment is clear or color- 
less when the bacteria are grown on said medium. Hence, 
positive insertion of the heterologous DNA sequence into 

5 these cloning vectors is identified by coloriess plaque forma- 
tion following infection of the E^ coli host cell with the recom- 
binant vector. Insertion of the DNA sequences encoding 
bGH(L) and pGH(P) into Ml 3 vectors are shown in Figs. 2 
and 9. respectively. 

W In a preferred embodiment. bGH(L) and pGH(P) DNA 

coding sequences carried on bacterial plasmids pBGHex-i 
and pPGHex-1. respectively, as in Seeburg et al.. DNA - 
(1983) 2(i):37-45. were isolated from these plasmids by 
site-specific restriction endonuclease cleavage. It should be 

75 noted that bacteria transfected with bacterial plasmids 
pBGHex-1 or pPGHgx-i. respectively, and subsequently cul- 
tured under conditions which permit expression of the bGH- 
(L) and pGH(P) coding sequences, respectively, produce 
somatropins with an N-terminal methionine (e.g. met-bGH- 

20 (L) or met-PGH(P), respectively). The respective sequences 
were then inserted into RF DNA of a modified Mi3mp8 
vector {Mi3mp8/Xbal) and Ml3mp9 vector as shown in 
Figs. 2 and 7, respectively. Insertion of the desired bGH(L) 
and pGH(P) DNA sequences into Ml3mp8/Xbal and 

25 Ml3mp9 RF DNA was confirmed by site-specific restriction 
endonuclease cleavage as again shown in Figs. 2 and 7, 
respectively. E. coli JMlOi were then transfected with one 
of these recombinant vectors as described by Messing et 
al.. Methods in Enzvmoloov f 1983) 101:20 and the ssDNA of 

30 the recombinant vectors isolated as described by Messing 
et al., Gene(l982) 19:269. Relevant portions of these re- 
ferences by Messing et al. are incorporated herein by 
reference. 

Once isolated, the ssDNA's of these recombinant vec- 

35 tors were modified by oligonucleotide-directed site-specific 
mutagenesis to create the DNA coding sequences for bGH- 
(A.V), bGH(A.L). bGH(V) and pGH(A). Specifically. bGH- 
(L) was modified to add a codon for alanine, e^g, GCC. at 
the 5*-end of the bGH{L) coding sequence as shown in Fig. 

40 3. It is anticipated that any of the four codons for alanine 
may be so added. The preferred alanine codon for optimal 
somatotropin yield in the expression system employed here- 
in is GCC. Confirmation of the addition of an alanine codon 
to create a bGH(A,L) coding sequence was achieved by 

45 DNA sequence analysis of the entire or 5 '-end of the bGH- 
(A,L) and DNA sequence by the method of Sanger et al., 
Proc. Nat'l. Acad.Sci.. U.S.A. (1977) 74:5463. 

The bGH(A.\/) coding sequence was created by 
oligonucleotide-directed site-specific mutagenesis of the 

50 bGH(A,L) coding sequence as shown in Fig. 4, by convert- 
ing the leucine codon at amino acid position 127 [in bGH- 
(A,L)] to a valine codon, e.g. GTG. It is again anticipated 
that any codon for valine may be employed in this conver- 
sion. The creation of a bGH(A,V) coding sequence was 

55 again confirmed by DNA sequence analysis of the resultant 
bGH(A,V) coding sequence. 

The bGH(V) coding sequence which results in produc- 
tion of met-bGH(\/) proteins in bacteria transfected with an 
expression vector comprising the bGH(V) coding sequence 

60 was similarly created by oligonucleotide-directed site-spe- 
cific mutagenesis of the bGH(L) coding sequence by con- 
verting the leucine codon at amino acid position 126 [in 
bGH(L)] to a valine codon. e.g. GTG. 

The creation of a pGH(A) coding sequence by 

65 oligonucleotide-directed site-specific mutagenesis of the 
pGH(P) coding sequence was similarly performed as shown 
in Fig. 8 and described more fully below, and confirmed by 
DNA sequence analysis. 



8 



15 



0 193 515 



16 



Having now isolated and constructed the desired het- 
erologous ONA sequences, as exemplified for bG*H(A,L), 
bGH(A,\/) and pGH{A). these sequences may be replicated 
and numerous copy numbers generated by propagation of 
the respective recombinant vectors by methods known to 
those skilled in the art and referenced above. These het- 
erologous DNA sequences may not be inserted into any 
appropriate expression vector for production in procaryotes 
of the desired heterologous polypeptides. 

PRODUCTION OF N-TERMINAL ALANINE POLYPEP- 
TIDES 

As discussed previously, an appropriate expression 
vector should contain the necessary transcription and trans- 
lation signals for production of a heterologous protein in the 
chosen host cell along with a marker function for identifica- 
tion of those expression vectors into which the desired 
heterologous DNA sequence has been inserted. By use of 
a procaryotic expression vector, the recombinant DNA se- 
quences can be added to the genetic complement of a 
procaryotic organism via transduction, transformation or 
transfection (collectively referred to herein as 
" transfection" ) and said organism can then be cultured 
under conditions (generally governed by the promoter and 
host employed) that cause the desired polypeptide to be 
produced. Thus the "genomic" DNA of the organisms used 
in this invention contain both chromosomal and episomal 
DNA. 

A number of expression vectors have been described 
for heterologous gene expression and heterologous protein 
production in procaryotic host cells and are known to those 
skilled in the art. 

In one preferred embodiment of the present invention, 
expression vectors pBGH ex-i. see Seeburg et al., DNA - 
(1983) 2(1):37-45, and pBGH ex-i". comprising a modified 
pBGHex-i vector, were employed. 

The bGHex-i expression vector is a pBR322 bactertai 
plasmid carrying a gene for bGH(L). The gene comprises, 
in sequence, ■ a tryptophan promoter (ptrp), a Shine-Del- 
garno sequence, an ATG translation start/methionine codon 
immediately adjacent the N-terminal phenylalanine coding 
sequence, the first amino acid of the bGH{L) polypeptide, 
the bGH(L) coding sequence and a translation -stop codon. 
The marker function on the pBGHexo expression vector is 
antibiotic resistance. Specifically. pBGHex-i carries two anti- 
biotic resistance genes, one for ampicillin (amp') and the 
second for tetracycline resistance (tef) which confer spe- 
cific antibiotic resistance to otherwise susceptible host cells 
stably transformed with the expression vector. Thus, stable 
transformants may be selected for by growth in media 
containing either tetracycline, ampicillin or both antibiotics. 

In the examples of the present invention, expression 
vectors pMON3209 and pMON32l5 comprising pBGHex-i 
carrying bGH(A,L) and bGH(A,\/) coding sequences, re- 
spectively, in place of the bGH(L) coding sequence were 
generated as shown in Figs. 5 and 6. respectively. A 
bacteria such as E.coli . was then stabiy transformed with 
one of these expression vectors and transformants selected 
by growth on media containing the appropriate antibiotic. 
The expression plasmids contained within the transformed 
bacteria were then screened for the presence of the bGH- 
(A.L) and bGH(A,V) coding sequences in the correct 5' to 
3' orientation by restriction enzyme cleavage. 

In an example of the present invention, pBGHex-t' 
comprising the pBGHex-i vector modified by the removal of 
the EcoRl restriction site located upstream from the 5' -end 
of the ptrp coding sequence, was employed to create the 



expression vector pMON32l3 carrying the pGH{A) coding 
sequence in place of the bGH(L) coding sequence. The 
resultant pBGHex-i" vector contains only a single EcoRI site 
as shown in Fig. 9. The replacement of the pGH(A) coding 
5 sequences for the bGH(L) coding sequence in pBGHex-i ' 
to create the expression vector pMON32l3 is shown in Fig. 
10. E. coli were then transformed with said mixture and 
transformants selected by growth in antibiotic-containing 
media. The expression plasmids contained within the trans- 
10 formed bacteria were then screened for the presence of the 
pGH(A) coding sequences by restriction cleavage. 

Production of bGH(A,L), bGH(A.V) or pGH(A) in E, 
coli was achieved by transforming either E coli W3ii0, 
E. coli LE392 or E.coli strain 294, deposited and having 
;5 ATCC accession numbers 39936, 53010, and 53009, re- 
spectively, with one of the expression vectors. pMON3209, 
pMON32i5 or p/y/10N32i3 in accordance with the method 
described more fully below. The transformedE. coli W3ii0 
were then cultured under conditions which permit expres- 

20 sion of the somatotropin genes and production of the de- 
sired heterologous polypeptides. 

Purification of the heterologous peptide produced will 
depend on both the protein and host cell chosen. It has 
been observed, for example, that the heterologous proteins 

25 produced in such bacteria as E.coli are often precipitated 
within the cell in the form of "refractile" bodies. The term 
"refractile" is used because these bodies can actually be 
seen using a phase contrast microscope. A method useful 
in recovering heterologous proteins, in biologically active 

30 form is described in European Patent Application 114,506 - 
•(published August i, 1984) incorporated herein by refer- 
ence. Briefly, this purification method comprises concentrat- 
ing the host cells, lysing these cells to generate a cellular 
extract or homogenate and then isolating the refractile bod- 

35 ies by differential centrifugation, all the steps of which are 
known to those skilled in the art. The isolated refractile 
bodies are dissolved in a strong denaturant such as 
guanidine hydrochloride and the solubiiized protein is then 
exchanged in a suitable solvent (for example, urea), purified 

40 by chromatographic means, and finally biologically acti- 
vated, i.e., permitted to assume its active configuration and 
then oxidized so that such configuration is maintained by 
disulfide bonds between its appropriate cysteine residues, 
as described in European Application 114,506. A more 

45 detailed purification of such heterologous proteins is de- 
scribed in two con currently filed U.S. applications, one by 
S. 8. Storrs entitled "Method of Somatotropin Solubiliza- 
tion" and the second by L. A. Bentle, S. B. Storrs and J. 
W. Mitchell entitled "Method of Somatotropin Naturation". 

50 incorporated herein by reference. These two concurrently 
filed U.S. patent applications and the present application are 
all commonly assigned to Monsanto Company. Subsequent 
purification of the heterologous polypeptide to rid it of con- 
taminating bacterial proteins can be achieved by conven- 

55 tional chromatographic means such as gel filtration or ion 
exchange chromatography. Typically, the resulting purified 
compositions will contain, by weight, from about 90% to 
about 99.5% of the N-alanyl polypeptide and from about 
0.5% to about 10% protein native to the bacteria or other 

60 procaryotic host in which the polypeptide was produced. 

It has been found that usually at least about 80% of 
the heterologous peptide expressed by the method of this 
invention has an N-terminal structure of NHj-ala... . The 
remaining polypeptide is typically in the methionyl form, 

65 having an N-terminal structure of NH2-met-a]a... . By vary- 
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ing growth conditions and/or tinning of the induction of gene 
expression, however, it is possible to raise the proportion of 
polypeptide with alanine at the N -terminal to at least about 
95% or even greater. 

In a particularly preferred ennbodiment of the present 
invention, the somatotroptn polypeptide species produced 
and isolated as described above were shown to exhibit 
somatotropin-like biological activity as assayed in the rabbit 
liver receptor assay perfornned as described by Tsushima 
and Friesen, J. Clin. Endocrinol. Metab. (1973) 37:334-337 
and a rat weight gain bioassay. In the latter assay, the 
bioactivities of the coli produced somatotropins are as- 
sessed relative to a known lot of somatotropin (e. g. bovine 
or porcine pituitary somatotropin) by relating the amount of 
weight gain of hypophysectomized rats to varying amounts 
of injected compound. Specifically, titrated dosages - 
(between 0 and 60 micrograms) of the unknown or standard 
somatotropin source are injected into hypophysectomized 
rats (95-135 g) on a daily basis for 7 or more days. Using 
multiple regression, body weight gains of animals receiving 
the known and unknown hormone lots are regressed on log 
transformed dosages. The slopes are tested to insure non- 
parallelism and intercept commonality. Bioactivities are re- 
ported as the ratio of the slopes times the activity of the 
standard. 

Use of the N-alanine bGH products of this invention 
increases milk production by cows and are believed to 
decrease the amount of feed required for a given output of 
milk by such cows. Administration to dairy cows of a 
lactation-enhancing amount of bGH species containing a 
valine at or around position 126 of the mature bGH protein 
is particularly well suited for potentiating milk production by 
these cows. Such products of this invention may be admin- 
istered to cows by injection, infusion or implantation in 
polymers or other media known to achieve the delivery of a 
required dosage in the circulatory system. Pharmaceutically 
acceptable base formulations such as solutions, emulsions 
or gels may be used, either encapsulated or not. These 
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formulations may contain a single bGH species or variant 
thereof or any prescribed combination of naturally-occurring 
and/or variant polypeptides (e.g. a mixture of bGH(A,\/) and 
bGH(A.L) and/or a mixture of bGH(A,V) and met-bGH(A)]. 

5 Dosages may range- from at least about 0.005 mg to about 
200 mg per animal per day and preferably from about 5 mg 
to about 40 mg per animal per day. The amount ■ most 
effective for increasing milk production and/or feed-to-milk 
efficiency may be determined by routine experimentation. 

10 The actual preferred dosage of bGH is dependent on such 
variables as the size, general health and nutritional status of 
the specific animal. Although milk production can be used to 
assess the effect of bGH on cows, other production prop- 
erties of the cows, such as overall growth rate and meat 

75 production can also be employed. bGH can, if desired, be 
administered to cows with other beneficial agents such as 
other biologically active proteins, antigens, or the like, and 
thereby provide enhanced effects. 

As previously discussed, the invention also contem- 

20 plates the production of variants of bGH which have an N- 
terminal alanine but which have deletions, additions, inver- 
sions and/or substitutions along the polypeptide chain. Such 
modifications having desirable lactation and/or growth en- 
hancing prop)erties can be identified by routine testing in 

25 cows. 

The following examples illustrate preferred embodi- 
ments of the present invention and are not intended to limit 
the invention's scope in any way. While this invention has 
been described in relation to its . preferred embodiments, 
30 various modifications thereof will be apparent to one skilled 
in the art from reading this application- 
Microorganisms and Plasmids 

35 The following microorganisms are available from the 

American Type Culture Collection (ATCC). i230i Parklawn 
Drive, Rockville, Maryland, 20852, U.S.A.: 

40 

coli W3110 

coli LE392 

coli strain 294 

coli W3110 (pMON3209) 

coli W3110 (pM0N3215) 

coli W3110 (pM0N3213) 



These deposits are available to the public upon the 
grant of a U.S. patent to the assignee of this application, 
Monsanto Company. These deposits will be available for 
the life of any such U.S. patent having the benefit of the 
filing date of this application. However, it should be under- 
stood that the availability of a deposit does not constitute a 
license to practice the subject invention in derogation of 
patent rights granted by governmental action. Furthermore, 
the present invention is not to be limited in scope by the 
microorganisms deposited, since the deposited embodi- 
ments are intended only as specific illustrations of the 
invention. 



EXAMPLE 1 

All oligonucleotides were synthesized in the Depart- 
ment of Biological Sciences, Monsanto, employing an Ap- 
plied Biosystems DNA synthesizer in accordance with the 
procedure set forth by the manufacturer, Applied 
Biosystems, Inc., Foster City, California. Restriction en- 
zymes and DNA modifying enzymes were purchased from 
New England Biolabs (Beverty, Massachusetts), New Eng- 
land Nuclear (Boston, Massachusetts) and Beihesda Re- 
search Laboratories (BRL) {Gaithersburg, Maryland). An 
Xbal linker was obtained from Collaborative Research, Inc. 
(Lexington, Massachusetts). T4 DNA ligase was purchased 
from BRL (Gaithersburg, Maryland). T4 DNA kinase was 
purchased from New England Biolabs (Beverly, Massachu- 
setts). "P -labeled nucleotides were purchased from Amer- 
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sham (Arlington Heights, Illinois) E. coli DNA polymerase I. 
Klenow fragment, was purchased from New England Nu- 
clear (Boston, Massachusetts). E. coli JMiOi was obtained 
from. Dr. Jo Messing, University of Minnesota (St. Paul, 
Minnesota). 

Restriction enzyme digestions, the T4 DNA ligase re- 
actions, and E. coli DNA polymerase I, Klenow fragment, 
reactions may be carried out in accordance with the proce- 
dures set forth by the manufacturers. Preferred buffers for 
the following restriction enzymers are as follows: for Xbal: 
lOOmM NaCi, 50mM tris, pH 7.5, lOmM MgSO*; for Ec- 
oRI, Hind 111 and Sma 1: 50mM NaCl. lOmM Tris, pH 7.5, 
lOmM MgSOft. T4 DNA Itgase reactions were run in buffers 
containing 25mM Tris, pH 8.0, lOmM MgCl,. lOmM 
dithiothritol (DTT), 2mM spermidine and 0.2mM ATP. E. 
coiiDNA polymerase I, Klenow fragment, was used in a 
buffer containing 20 mM Tris, pH 7.2, lOmM MgClj, lOmM 
(DTT). imM ATP, and imM each dATP, dGTP. dCTP, 
dTTP. Alpha-"P-dATP (400 ci/mmol) was added to the 
Klenow reaction if labeling of the newly synthesized DNA 
strand was desired. 

Oligonucleotides were labeled using gamma-"P-ATP - 
(sp. act. greater than 5000 ci/mmol) and T4 DNA kinase in 
lOOmM Tris, pH 8.0. lOmM MgClj, 5mM DTT. 

Ptasmids carrying DNA coding sequences for bGH(L) 
and pGH(P) (pBGHex-i and pPGHexo. respectively) were 
obtained from Genentech, Inc., So. San Francisco, Califor- 
nia. These plasmids can be prepared as described in Eu- 
ropean Patent Application 75,444 (published March 30. 
1983); Seeburg et al., DNA (1983) 2(l):37-45; Goeddel et 
al. NatLire(l979) 281:544-548; DeBoer et al., in Promoters: 
Structure and Function (1982); M. J. Ghambertin and R. 
Rodriguez eds., Chapter 293; Miozzarr and Yanofsky, J. 
Bacteriol. (1978) 133: 1457-1466; and Rosenberg and 
Court, Annual Review of Genetics 13:319-353. As shown 
in European Application 75.444 (DeBoer et al), the first 21 
translated codons in the DNA for bGH(L) are ATG TTC 
CCA GCT ATG TCT CTA TCT GGT CTA TTC GCT 
AAC GCT GTT CTT CGT GCT CAG CAT CTT or func- 
tional equivalents thereof. (Functional equivalents of those 
codons can of course be used alternatively.) Other relevant 
portions of these publications are incorporated herein by 
reference. 

Ml3mp8 and M13mp9 were obtained from Dr. Jo 
Messing, University of Minnesota (St. Paul, Minnesota). 

All bacterial growth media components and antibiotics 
were obtained from either Sigma (St Louis, MO) or Difco 
Laboratories (Detroit Michigan). 

EXAMPLE 2 

The following example demonstrates the construction of 
three DNA coding sequences which, when expressed, pro- 
vide for the direct production in bacteria of polypeptides 
containing an N -terminal alanine. Specifically. DNA coding 
sequences were constructed such that the translation initi- 
ation start/methionine codon (ATG) is immediately followed 
by an alanine codon (e.g. GCC). The three DNA coding 
sequences, comprising bGH(A,L). bGH(A,V) and pGH(A), 
were constructed by oligonucleotide-directed site-specific 
mutagenesis of previously isolated somatotropin DNA se- 
quences. This example also demonstrates the construction 
of a bGH(V) DNA coding sequence which when expressed 
in bacteria causes production of met-bGH(V). 



a. Construction of a bGHfA.L) DNA Codino Sequence 

The somatotropin, bGH(L), DNA coding sequence was 
excised from pGHex-i as an Xbal fragment and cloned into 
5 the Xbal site of a modified Mi3mp8 vector (Mi3mp8/Xbal). 
Construction of the Ml3mp8/Xbal vector which contains an 
Xbal linker in the original Smal site is shown in Fig. i. As 
shown in Fig. 2, Xbal cleaves at either end of the bGH(L) 
DNA coding sequence thus excising the complete bGH(L) 

10 coding sequence. The Xbal restricted pBGHex-i plasmid 
was mixed in the presence of T4DNA ligase with RF 
Mi3mp8/Xbal DNA linearized by Xbal restriction enzyme 
cleavage and treated with calf intestine alkaline 
phosphatase. The mixture was then incubated overnight at 

15 ^4''0. Treatment with calf intestine alkaline phosphatase 
prevents recircularization of the Ml3mp8/Xbal vector. Inser- 
tion of the bGH(L) DNA coding sequence into the 
Mi3mp8/Xba) vector to create recombinant vector 
Mi3mp8/BGHex-i was initially ascertained by coloriess 

20 plaque formation on a lawn of bacteria. E. coli JMiOi, 
grown on 1 x YT medium employing the soft agar overiay 
procedure described in Molecular Cloning: A Laboratory 
Manual . Maniatis, Fritsch & Sambrook, eds. (1982) pg. 64. 
which included 10 al lOOmM IPTG (isopropyl-jS-D- 

25 thiogalactopyranoside) and 50 u\ 2% (w/v) X-GAL(5- 
bromo-4chloro-3-indolyl-^-D-galactopyranoside) in 3 ml of 
top agar, and transfected with said recombinant vector as 
previously described. Insertion of the bGH(L) coding se- 
quence was confirmed by cleavage of isolated RF DNA, 

30 Molecular Cloning: A Laboratory Manual . Maniatis, Fritsch 
& Sambrook, eds. (1982). Chapter 3, of the recombinant 
vector with Xbal which yields a 590 base pair fragment 
comprising the inserted sequence. The 590 base pair frag- 
ment was identified by agarose gel electrophoresis in one 

35 percent (w/v) agarose as described in Molecular Cloning: A 
Laboratory Manual . Maniatis. Fritsch & Sambrook, eds. 
(1982). All subsequent restriction fragments were identified' 
by this referenced method. The orientation of the inserted 
bGH(L) coding sequences was ascertained by cleavage of 

40 the Rf5- recombinant vector with Smal and Hindtll. When the 
coding sequences are in the correct 5' to 3' orientation, 
cleavage with these restriction enzymes should yield a 207 
base pair fragment The isolation of ss phage DNA was 
conducted in accordance with the method of Messing et at, 

45 Gene f 19821 19:269. The Ml3mp8/BGHex-i vector was 
then employed as a template in the oligonucleotide-directed 
site-specific mutagenesis essentially as described by Zoller 
and Smith, Nuc. Acids Res. (1982) 10:6487-6500, Zoller 
and Smith Methods in Enzymol. (1983) 100: 468-500. 

50 Norris et at. Nuc. Acid Res. (1983) 1 1:5 103-51 12, the 
relevant portions of which are herein incorporated by refer- 
ence. 

Figure 3 diagrams the mutagenesis procedure for cre- 
ation of a DNA coding sequence for bGH(A,L) from the 

55 bGH(L) coding sequence. Briefly, an oligonucleotide primer 
(see Table i , below) containing the sequence of the desired 
mutation is used to prime synthesis of a closed-circular 
DNA copy of the ssDNA M13mp8/BGHex-i template. The 
closed -circular dsDNA molecules thus generated are sepa- 

60 rated from incomplete and ssDNA circles by alkaline su- 
crose gradient centrifugation as described by Zoller and 
Smith, Methods in Enzymol. (1983) 100:468-500. The 
closed -circular dsDNA molecules are then used lo trans- 
form E. cQli JMIOI as described by Messing et al.. Gene - 

65 (1982) 19:269-276 and the resulting coloriess plaques are 
lifted onto Pall filters obtained from PaJI Ulirafine Filtration 
Corp. (Glen Cove, New York) and screened for hybridiza- 
tion to a "P -labeled form of the oligonucleotide primer used 
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to generate the site-specific mutagensis. The lifting of said 
plaques was conducted in accordance with methods de- 
scribed by the Pall Filter manufacturer. Hybridization 
screening was carried out using nylon Biodyne filters as 
described by Pall Uitrafine Filtration Corporation (Glenn 
Cove, N.Y.) in their "Protocol Guide for ON A Transfer to 
Pall Biodyne A Nylon Fitters" (1983). Filters were 
washed at increasing temperatures until the radiolabeled 
signal was removed from a control filter which was prepared 
with Ml3mp8/bGHex.i phage. A typical filter washing pro- 
tocol employed a room, temperature wash in 6xSSC (0.9M 
NaCi and 0.09M NaCitrate) for ten minutes followed by a 
50**C wash in SxSSC for five minutes and subsequent 
washings at temperatures increasing by 5*C. Plaques 
which hybridized to radiolabeled oligonucleotide primer at 
temperatures higher than the control phages were pre- 
sumed to carry the newly created bGH(A,L) coding se- 
quence and were termed potential positives. Alternatively, 
individual colorless plaques were picked from the E. coli 
JM101 transformations and grown in 5 milliliters (ml) of 2 x 
YT medium (1.6% (wA/) tryptone. 1,0% (w/v) yeast extract, 
0.5% (w/v) NaCl) overnight at 37 °C with aeration. Phage 
DNA, prepared in accordance with Messing et al.. Gene - 
(1982) 19:269, was then spotted onto nitrocellulose, 
hybridized with radiolabeled primer, and washed in increas- 
ing temperatures as described above. Phage DNA which 
showed hybridization temperatures higher than 
Mi3mp8/bGHex-i control plaques were similarly termed po- 
tential positives. Potential positive plaques from both 
screening procedures were grown as described above and 
used to prepare ss phage DNA, which was then sequenced 
according to the procedure of Sanger et al. Proc. Nat' I 
Acad. Sci. . U.S. A (1977) 74:5463 to confirm that they 
carried the bGH(A,L) coding sequence. The RF DNA of 
Mi3mp8/BGHex-t(ala) was also screened by restriction en- 
zyme analysis with Hae III to confirm the addition of an 
alanine codon following the start signal/methionine codon 
ATG as the alanine codon creates an additional Hae 111 
restriction site. The frequency of addition of the alanine 
codon was about 2% to the bGH(L) DNA coding sequence. 

b. Construction of a bGH(A.V)DNA Codino Seouence 

Construction of the bGH(A,V) DNA coding sequence, 
screening and sequence confirmation were carried out using 
the same procedure as above except that the template was 
Ml3mp8/BGHex-i(ala). as shown in FIG. 4, and the 
oligonucleotide primer was as shown in Table i , below. The 
frequency of conversion of the leucine codon to a valine 
codon was about 10%, 

c. Construction of a bGHfV) DNA Coding Sequence 

Construction of the bGH(V) DNA coding sequence, 
screening and sequence confirmation were carried out using 
the same procedure as above. Specifically, the template 
was Mi3mp8/BGHex.i and the oligonucleotide primer was 
the same as employed to create bGH(A,V). The frequency 
of conversion of the leucine codon to a valine codon was, 
again, about 10%. 



d. Construction of a oGH(A.) DNA Coding Sequence 

Oligonucleotide-site-specific mutagenesis was similarly 
employed to add an alanine codon to the pGH(P)DNA 

5 coding sequence described by Seeburg et aL, DNA (1983) 
2(i):37-45. The mutagenesis procedure as diagramed in 
FIG. 8 was conducted as follows. 

The 590 base pair pGH(P) DNA coding sequence 
carried on the pPGHex-i plasmid was removed from the 

W plasmid by restriction enzyme cleavage with EcoRI and 
Hindlll which cleave the plasmid at the 5'-and 3'-ends of 
the pGH(P) coding sequence, respectively, as shown in 
FIG. 7. The restricted pPGHgx.i plasmid was then mixed 
with Mi3mp9 RF DNA similarly cleaved with EcoRl and 

15 Hindlll but additionally pretreated with calf intestine alkaline 
phosphatase to prevent religation of the Mi3mp9 restriction 
fragments. T4 DNA ligase was then added to said mixture. 
Owing to the abberrant ends on both the RF phage DNA 
and pGH(P) DNA coding sequence created by the use of 

20 two different restriction enzymes, the pGH{P) DNA will 
selectively be inserted in the RF phage DNA and will be 
inserted in the correct 5' to 3' orientation as shown in FIG. 
7, coli JM101 were then transformed, as described 
above for Ml3mp8/BGHex-i. with the recombinant 

25 Ml3mp9/PGHex.i vector carrying the pGH(P) DNA coding 
sequence. The transformed E coUMiOl were then grown 
on, 1 X YT medium containing colorimetric reagents and 
selected by coloriess plaque formation as described pre- 
viously. Insertion of the pGH{P) DNA coding sequences 

30 was confirmed as follows. Colorless plaques were picked 
and RF lvll3mp9/PGHex-i DNA, isolated as previously de- 
scribed, was then cleaved with EcoRI and Hind III and 
subjected to agarose gel electrophoresis yielding a 590 
base pair fragment comprising the inserted pGH(P) DNA. 

35 Mi3mp9/PGHex-i phage were then propagated in 

coliJMiOi and the ss phage DNA isolated as described 
above. 

The fvtl3mp9/PGHex.i DNA was then employed as a 
template for the oligonucleotide-site-specific mutagenesis, 

40 shown in FIG. 8, in accordance with the procedure de- 
scribed above for creation of a bGH(A,L) coding sequence, 
employing a specific primer (see Table i, below). The 
frequency of addition of an alanine codon, herein GCC, to 
the pGH{P) coding sequence was approximately 12%. The 

45 resultant pGH(A) coding sequence was again confirmed by 
DNA sequence analysis. 
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EXAMPLE 3 

This example demonstrates the construction and ex- 
pression of three recombinant expression vectors which 
provide for the direct production in bacteria of poly peptides 
containing an N-terminal alanine. The three polypeptides so 
produced are the somatotropin species bGH{A,L). bGH- 
(A.V) and pGH(A). This example also demonstrates the 
construction and expression of bGH(V) and expression of 
bGH(L). The polypeptides so produced are met-bGH{V) and 
met-bGH(L), respectively. 



55 a Expression of bGHfA.L) bGHfA.V). bGHfL) and bGHfV) 

The bGH(A,L). bGH(A,V) and bGH(V) DNA coding 
sequences carried on recombinant vectors Mi3mp8/BGHex. 
1 (ala). M 1 3mp8/BGHex-i {ala.val), and M 1 3mp8/8GHex-i - 

60 (val), respectively, were used to replace the bGH(L) DNA 
coding sequence carried on the pBGHgx-i expression plas- 
mid (see FIGS. 5 and 6). This was done by digestion of the 
respective M13 RF ONA's with Xbal. Expression plasmid 
pBGHex-i was also digested with Xbal and subsequently 

65 treated with calf intestine alkaline phosphatase to prevent 
reiigation of the restriction framents. Each of the digested 
RF ONA's were then separately mixed with the digested 
and treated pBGHex-i DNA and ligated as described pre- 



13 



25 



0 193 515 



26 



vioysly overnight at 14°C- The recombinant expression 
vectors thus fornned were labeled pMON3209, pMON32i5 
and pM0N32l4 carrying the bGH(A,L). bGH(A,V), and 
bGH(V) DNA coding sequences, respectively. E. coli 
W3110 were then transformed with the ligation mixture 
containing pMON3209 or pMON3215 or pM0N3214 or 
pBGHex-i and grown in Lauria Broth (LB) comprising i% - 
(w/v) tryptone, 0.5% (w/v) yeast extract and 0.5% (w/v) 
NaCl and containing 12.5 ug/ml tetracycline and 200 ug/ml 
ampiciilin. E. Qoli W3110 containing pMON3209 have 
ATCC accession number 53024. E. coli W3 110 containing 
pMON32l5 have ATCC accession number 53022. Trans- 
formation was conducted, briefly, as follows. Approximately 
50 ml of E coliW3l 10 were grown in LB media to an 
OD600 = 0.60. The cells were then pelleted and resuspen- 
ded in 10 ml Buffer A comprising 25mM Tris. pH 7.6, and 
lOmM NaCI. The cells were then pelleted and resuspended 
in 1 ml Buffer A to which 14 ml Buffer 8 comprising 25mM 
Tris, pH 7.6. lOmM NaCI, 50 mM CaCU, was added and 
the suspension incubated on ice for 30 minutes. The cells 
were then pelleted and resuspended in 3 ml Buffer B. An 
aliquot of 0.2 ml of resuspended cells was then mixed with 
0.1 ml Buffer B and 0.1 to 0.5 ug of the desired recom- 
binant expression vector (pMON3209 or pfviON32l5 or 
pMON32i4 or BGHex-0 and incubated on ice for 60 min- 
utes. The incubated mixture was then heated for one minute 
at 37*'C after which time 3 ml LB media was added and 
the resultant mixture then incubated 60 minutes at 37 °C. 
The cells were then pelleted and resuspended in 300 ml LB 
media and grown on LB plates containing antibiotics as 
previously described. Resistant colonies were then selected 
and the pMON3209. pMON32l5, pBGHex-i and 
pM ON 32 14 expression vector DNA's isolated in accor- 
dance with the procedure described in Molecular Clonings: 
A Laboratory Manual . Maniatis, Fritsch & Sambrook, eds. - 
(1982). The PMON3209, pMON3215, pBGHex-i and 
pMON32i4 DNA's were screened for the presence of a 
590 base pair Xbal fragment and a 200 base pair Hind 
IH/Smal fragment showing the presence of the bGH(A,L), 
bGH(V) and bGH(A,V) DNA coding sequences in the cor- 
rect orientation. The pMON3209 and pMON32l5 expres- 
sion plasmids were further screened by restriction analysis 
with Haelll to confirm the presence of* a new Haelll site 
resulting from the addition of a GCC (alanine) codon. Finally, 
the 590 base pair Xbal fragment from the pMON3209, 
pMON32i4 and pMON32i5 vectors were partially sequen- 
ced as described previously to confirm the presence of the 
bGH(A,L), bGH(V) and bGH(A,V) DNA coding sequences 
in these expression vectors. 

Single colonies of E. coliW3l 10 carrying pMON3209, 
pMON32i4, BGHex-i. Or pMON32i5 were inoculated sep- 
arately into 5 ml of LB containing 12.5 ug/ml tetracycline 
and grown overnight at 37 *C with aeration, 0.5 ml of the 
overnight cultures were then used to separately inoculate 
25 ml of M9 media comprising (per liter of media) 100 ml 
10X salts (70 grams (g) Na,HP04, 30g KH,PO.. 5g NaCI, 
lOg NH^Ct per 1000 ml total volume) 1.2 ml 1M MgSO.. 
0.25 ml 0-1% B„ 12.5 ml 20% (w/v) glucose, 0.025 ml 1M 
CaCl,, supplemented with 0.5% (w/v) casamino acids and 
5.25 ug/ml tetracycline, contained within 250 ml flasks. The 
inoculants were then each grown at 37**C with aeration until 
reaching an OD600 (optical density at 600 nanometers) - 
= 1 .0- An aliquot of 0.2 ml was next removed from each of 
said flasks and individually lysed in sodium dodecyl sulfate - 
{SDS)-potyacryfamide gel electrophoresis (PAGE) buffer 
and assayed by SDS-PAGE in accordance with Laemmli, 
Nature (1970) 227:680-685. Proteins Of 22.000 daltons 
were at high levels in E. csli W3ii0 from both of the two 



gene constructs. Furthermore, both 22.000 dalton proteins 
bound in Western blot assays (see Krivi and Rowold, 
Hvbridoma (1984) 3:252-262) to a monoclonal antibody, 
Fn-Ai-B6, (see id) produced against bovine pituitary 

5 somatotropin, thereby confirming that the polypeptides are 
related to pituitary somatotropin. E.coli W3i i0 cells parrying 
the parental pBR322 plasmid do not produce a protein of 
22,000 daltons that binds to anti-somatotropin antibodies. 
Bacteria carrying the expression plasmids pMON3209. 

10 pMON32l4, pBGHex.i and pMON32l5 were stored as 
follows. A single colony of E.coli W3110, transformed with 
one of the plasmids (pMON3209 or pMON32l5 or 
pM0N32l4 or pBGHex'.i). was grown overnight at 37**C in 
5 ml LB plus 12.5 ug/ml tetracycline with aeration. A 1 ml 

75 aliquot from each overnight culture was added separately to 
individual flasks containing 25 ml LB plus 12.5 ug/ml 
tetracycline and grown to an OO600 = 1 .0. The cells from 
each flask were then harvested by centrifugation at 6000 x 
g at 4*0 for 5 minutes. The pellets were individually 

20 resuspended in 12 ml LB plus 7.5% (v/v) DMSO and 
immediately frozen on dry ice in i ml aliquots. The cells 
were then stored in a liquid nitrogen freezer. Additionally, 
about 1 0 micrograms of purified plasmid DNA was stored at 
-80**C. 

25 

b. Expression of oGH(A) DNA Codino Sequence 

As shown in FIG. 10. the pGH{A) coding sequence 
carried on the recombinant vector Ml3mp9/PGH ex.i(ala) 

30 was used to replace the bGH(L) DNA coding sequence 
carried on a modified pBGHgx.i expression vector. The 
modified expression vector, pBGHex-T, comprises a pBG 
Hex-i expression vector wherein the EcoRI restriction site 
located upstream from the tryptophan promoter (ptrp) has 

35 been removed (see FIG. 9). The modified pBGH ex-i" 
expression vector was created by partial EcoRl digestion of 
pBGHex.r followed by SI nuclease treatment to remove the 
sticky ends. The restricted pBG H ex- 1 vector was then recir- 
cularized by use of T4 DNA ligase and used to transform E. 

40 coli JM101. all as previously described. Plasmid DNA from 
single colonies was next screened for a 590 base pair 
EcoRI/HindlH fragment carrying the pGH(A) coding se- 
quence and a 1050 base pair EcoRI/Pstl fragment carrying 
the ptrp sequence (See FIG. 9). Removal of this EcoRI 

45 restriction site facilitates the site specific insertion of the 
pGH(A) coding sequence into the pBGHex-i' expression 
vector in the proper orientation as shown in FIG. 10. The 
recombinant expression vector thus formed is hereinafter 
referred to as pMON32i3. The mixture containing 

50 pMON32i3 was then used to transform E.coli W3110 and 
the transformants grown and selected for as described 
above. E coliW3i 10 containing pMON32l3 have ATCC 
accession number 53023. Replacement of the pBGH(L) 
coding sequence with a pGH(A) coding sequence in the 

55 pBGHex-i " expression vector was confirmed by isolating 
pMON32i3 ONA and then cleaving said expression vector 
with EcoRI and Hind III to generate a 590 base pair 
fragment and cleavage with Hae III to show the presence of 
an additional Hae 111 restriction fragment generated by the 

60 presence of an alanine codon in the pGH(A) ONA coding 
sequence. Final confirmation of the presence of the pGH- 
(A) ONA coding sequence in the pMON32i3 expression 
vector was made by partially sequencing the EcoR I/Hind III 
590 base pair fragment as previously described. 

65 
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Expression of the pGH{A) DNA coding sequence and 
production of pGH(A) in E coliW3l 10 was achieved in the 
^ manner set forth for bGH(A,L) and bGH(A.V) production 

described above. Similarty, high levels of a protein of 
r 22.000 daltons. demonstrable by SDS/PAGE as described 

above, were achieved. 

E. coli W3110 carrying the pMON32l3 expression 
vector were stored as previously described and employed in 
large scale (10 to 100 liter fermentation) production of pGH- 
-i (A) also as previously described. The pGH(A) protein con- 

tent in the 100-liter batch fermentation was, again, approxi- 
* mately 1 g/liter of broth by radioimmunoassay of Rosner et 

ai.. J. Immunol. Methods (1982) 52:175-181. 

EXAMPLE 4 

X This example was performed in order to determine the 

I N -terminal amino acid sequence of the heterologous pro- 

teins, bGH(A.L), bGH(A,V), met-bGH(V). met-bGH(L) and 
pGH(A), produced in bacteria as examples of the method of 
the present invention. 

The somatotropin polypeptides produced in E^^ coli 
were purified from crude solubilized refractile bodies con- 
taining either bGH(A.L), bGH(A.V), mei-bGH(V), met-bGH- 
(L) or pGH(A) by immunoadsorbent chromatography as 
described by Krivi & Rowold, Hybridoma (1984) 3:151-161. 
All three somatotropin species purified by immunoadsorbent 
chromatography appeared to be greater than 95% pure by 
^ SOS-PAGE analysis of i ug purified protein run on a 7.5 to 

^ 15% (w/v) gradient gel performed in accordance with the 

method of Laemmli, Nature (1970) 227:680-685. Protein 
concentrations of the purified bGH species were determined 
by routine high performance liquid chromatography analysis. 
Proteins purified by immunoaffinity chromatography for 
■ use in N- terminal sequence analysis were dialyzed exhaus- 

tively against water and then lyophilized. Prior to N -terminal 
sequence analysis, the purified proteins were resuspended 

iin ammonium bicarbonate buffer comprising 50mM ammo- 
nium bicarbonate plus 0.1% (w/v) SDS and dialyzed 
against the same buffer to remove residual tris- 

((hydroxymethyl) amino methane(tris) and glycine. An Ap- 
plied Biosystems Protein Sequencer Model 470A (Applied 



Bibsystems, Inc. Foster City, CA) was then employed for all 
N- terminal sequencing analysis, as described by Hunkapiller 
et al. (1983), Methods in Enzvmol. 9i :399-4i3 and Hun- 
kapiller et al. (1983), Methods in Enzvmol. 91:486-493. 

5 Table 2, below, shows the results of the sequence 

analysis for several preparations of bGH(A,L). bGH(A,\/), 
met-bGH(V), met-bGH(L) and pGH(A) polypeptides. The 
amount of protein with N-terminal methionine is shown in 
the table as a percentage of the total somatotropin in the 

10 sample. Two methods were used for methionine quantifica- 
tion. With the indirect method the amount of NH2-met-aia- 
phe... in a predominantly NH2-ala-phe.. population was 
calculated from differences in lag signals. Since this proce- 
dure depends on an estimate of "normal" sequence lag, 

75 which varies from cycle to cycle, it is only a crude estimate 
of the NH2-ala-phe... sequence present. The direct method 
comprising an Edman degradation reaction compared signal 
strength of PTH-met to PTH-ata after separation of PTH- 
met from chemical noise by high performance liquid 

20 Chromatography (HPLC). "PTH" refers to phenyl-thiohydan- 
toin. Specifically, the Edman degradation sequencing reac- 
tion consists of reacting the N-terminal amino acid with a 
reagent that causes cleavage of that amino acid and its 
subsequent release as PTH -derivative of that amino acid. 

25 The latter procedure will give good estimates of % met-ala- 
phe... as long as free amino acid contamination is low. 

As Shown by the N-terminal sequencing results pre- 
sented in Table 2, below, no evidence for methionine pro- 
cessing is seen when the N-terminal methionine is followed 

30 by phenylalanine. However, 80% or more of the molecules 
produced from the MBS(A,L) and MBS(A,V) gene con- 
structs have alanine rather than methionine at the N-termi- 
nal. The extent of N -terminal methionine processing is vari- 
able in cells from different fermentation runs, but always 
35 occurs in at least 80% of the somatotropin molecules. 
Additionally, production levels of approximately 10 to 15% 
of the total bactenal protein were achieved for the 
somatotropins produced in the transformed microorganisms. 

40 

TABLE 2 



45 



50 



55 



60 



65 



15 



29 



0 193 515 



30 



N-Terminal sequence analysis of met-bGH(L), bGH(A,L) , 
bGH(A,V) and pGH(A) proteins, 



Sample 



% N-Terminal Methionine^ 
Indirect Direct 



met-bGH(L) 
bGH(A,L) 



92.0 

20.32 

10.8 



100.0 
18. 42 
<6.0 



bGH(A,V) 



7.52 
16.7 



<3.02 
9.0 



pGH(A) 



16.7 



17.0 



met-bGH(V) 



not done 



100.0 



1. The amount of protein with N-terminal methionine is 
shown as a % of the total somatotropin in a sample. 

2. The two numbers represent the data for protein purified 
from 2 independent fermentations. 



EXAMPLE 5 

This example was pertornned in order to determine the 
ability of the bGH species produced in bacteria to stimulate 
milk production in dairy cows. As shown in Table 3 below, 
both bGH {A,V) and met-bGH(V) unexpectedly stimulated 
milk production to a greater extent than their respective 
leucine containing analogues, bGH(A,L) and met-bGH(L). 
Additionally, the valine bGH species tested when compared 
collectively against the leucine bGH species tested showed 
a statistically (p = .02) greater enhancement of milk produc- 
tion. 

The study was conducted briefly as follows, Holstein 
cows in their second and third trimester of lactation were 
given 5 milliliters (ml) sodium bicarbonate solution, pH 9.8 1 
.5, alone (herein referred to as "control sample") or con- 
taining about 25 milligrams (mg) of bGH(A,V), bGH(A.L). 
met-bGH(V) or met-bGH{L) {herein collectively referred to 
as "test somatotropins"), daily. All test somatotropins and 



40 



45 



50 



55 



control samples were administered parenterally by in- 
tramuscular injection into the semitendinosus muscle daily 
for 21 consecutive days. The individual milk production of 
each cow was recorded at each PM and AM milking 
beginning six days prior to the first daily injection through 
day 21 of injection. Milk fat, protein and somatic cells were 
aniayzed weekly by the DHIA Test Laboratory at DHIA 
Testing Center, Springfield. Missouri, 65803. The results 
presented in Table 3, beJow indicate that the increase in 
milk production resulting from administration of bGH {A,V) 
to cows was about 50% greater than that obtained using 
bGH(A,L), and about 17% greater for met-bGH(V) as com- 
pared to met-bGH(L). Such unexpected productivity advan- 
tages provided by the valine bGH species of this invention 
are clearly of very great potential economic benefit to 
dairymen. 
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Claims 



1. A composition comprising at least one bGH species 
containing a valine at or around position 126 wherein the 
concentration of said species, based on the weight of ail 
bGH in said composition, is substantially greater than the 
concentration of similarly valine-containing species in pooled 
pituitary bGH, 

2. A composition of Claim 1 wherein essentially all bGH is 
said valine-containing species. 

3. A composition of Claim 1 suitable for administration to 
cows. 

4. A composition of Claim 1 which, when administered to a 
cow, provides an enhancement of lactation by that cow 
exceeding that obtainable by similarly administering an 
equimolar amount of pooled pituitary bGH. 

5. A composition of Claim 1 which, when administered to a 
cow, provides an enhancement of lactation by that cow 
exceeding that obtainable by similarly administering an 
equimolar amount of bGH(A,L). 

6. A composition comprising essentially pure valine bGH 
species. 

7. The composition of Claim 6 wherein the valine bGH 
species comprises bGH(A.V). 

8. A composition comprising essentially pure valine bGH 
species able to enhance milk production in dairy cows to a 
measurably greater extent than an equimolar amount of an 
otherwise-identical leucine bGH species. 

9. A composition of Claim 8 wherein the essentially pure 
bGH is selected from a group consisting of bGH(A,V) and 
met-bGH(V). 

10. A composition comprising a valine bGH species pro- 
duced in a microorganism by recombinant DNA techniques. 

1 1 . A method for enhancing lactation of a cow, comprising 
administering to said cow a lactation-enhancing amount of a 
composition of Claims 1 , 4, or 5. 

12. A method for enhancing lactation of a cow, comprising 
administering to said cow a lactation -enhancing amount of a 
composition of Claims i. 2, 6, or 10 wherein the resulting 
lactation enhancement exceeds that obtainable by sepa- 
rately administering pooled pituitary bGH in an equimolar 
amount 

13. A method for enhancing lactation of a cow, comprising 
administering to said cow a lactation -enhancing amount of a 
composition of Claims i, 6, or 10 wherein the resulting 
lactation enhancement in said cow exceeds that obtainable 
by separately administering an equimolar amount of an 
otherwise identical leucine bGH species. 

14. A method of enhancing milk production in cows com- 
prising administering a composition comprising a lactation- 
enhancing amount of an essentially pure valine bGH spe- 
cies. 



15. A method of Claim 14 wherein the valine bGH species 
comprises bGH(A,V). 

16. A method for bacterially producing a heterologous poly- 
5 peptide having an N -terminal alanine which comprises caus- 
ing expression of genomic DNA in selected bacteria, said 
ONA containing the codons for said poly-peptide imme- 
diately preceded by from one to about three contiguous 
methionine codons including a translation start 

70 signal/methionine codon, and recovering the resulting poly- 
peptide having an N-terminal alanine produced within said 
bacteria 

17. The method of Claim 16 wherein a minimum of about 
15 80% by weight of the polypeptide produced within said 

bacteria from expression of said DNA contains an N-termi- 
nal alanine. 

18. The method of Claim 16 wherein the bacteria are E 
20 coli . 

19. The method of Claim 16 wherein said resulting polypep- 
tide contains an amino acid sequence essentially the same 
as that of a naturally occurring eucaryotic polypeptide. 

25 

20. The method of Claim 19 wherein said eucaryotic poly- 
peptide has somatotropin activity. 

21. The method of Claim 19 wherein said resulting potypep- 
30 tide has lactation-enhancing activity. 

22. A polypeptide having an N-terminal alanine prepared by 
the method of Claim 16. 

35 23. A composition comprising a polypeptide selected from 
bGH(A,L) and bGH(A,V) prepared by the method of Claim 
19, said composition being substantially free of protein of 
bovine origin. 

40 24. A composition comprising pGH(A) substantially free of 
protein of porcine origin and prepared by the method of 
Claim 19. 

25. A composition comprising pGH(A) substantially free of 
45 protein of porcine origin. 

26. The composition of Claim 25 useful in potentiating 
porcine pre- adult growth. 

50 27. A recombinant ONA vector selected from a bacterial 
ptasmid and a bacteriophage each having inserted therein 
DNA containing from one to about three contiguous 
methionine codons followed immediately by the codons for 
a polypeptide selected from bGH{A,L), bGH(A,V) and pGH- 

55 (A), said methionine codons including a translation start 
signal/methionine codon and said vector being capable of 
directing expression of said DNA in bacteria 

28. A gene comprising a promoter that functions in bacteria 
60 a ribosome binding site, from one to about three methionine 

codons followed immediately by a sequence coding for a 
heterologous polypeptide having an N-terminal alanine fol- 
lowed by a translation stop signal, said methionine codons 
including a translation start signal/methionine codon. 

65 

29. A method for enhancing lactation of a cow comprising 
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administering a lactation-enhancing amount of bGH(A,L) 
substantially free of protein of bovine origin. 
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FIGURE 5 
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FIGURE 6 
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FIGURE 7 
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FIGURE 9 
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FIGURE 10 
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